# Real-time Image Captioning
Moondream 2b 2025 04 14 4bit
Apache-2.0
Moondream is a lightweight vision-language model designed for efficient cross-platform deployment. The 4-bit quantized version released on April 14, 2025 significantly reduces memory usage while maintaining high accuracy.
Image-to-Text
Safetensors
M
moondream
6,037
38
Clip Gpt2 Finetuned
This is a fine-tuned version of CLIP-GPT2 for real-time image captioning tasks, designed to assist visually impaired individuals in understanding image content.
Image-to-Text
Transformers

C
vidi-deshp
18
0
Moondream2 Llamafile
Apache-2.0
moondream2 is a compact vision-language model specifically designed for efficient operation on edge devices, offering convenient deployment through the llamafile format.
Image-to-Text
M
cjpais
310
30
Featured Recommended AI Models